
SEQUENCE LISTING 



<110> Dunn-Coleman, Nigel 
Goedegebuur, Frits 
Ward, Michael 
Yao, Jian 

<120> EGVI Endoglucanase and Nucleic Acids 
Encoding the Same 

<130> GC698 

<140> US 10/026,994 
<141> 2001-12^8 



»(3' 



<1605 

<170> FastSEQ for Windows Version 4.0 



<210> 1 
<211> 2710 
<212> DNA 

<213> Trichoderma reesei 



<400> 1 

ccacgcgtcc gagcagtgtc ctcctcctca ctgcttcgtc atgaaggtct ctcgagtcct 60 

tgcccttgtc ctgggggccg tcatccctgc ccatgctgcc ttttcatgga agaacgtcaa 120 

gctcggcggc ggcggcggct tcgtccccgg catcatcttc catcccaaga caaaaggcgt 180 

agcatatgca cgaacagata ttggcgggct gtaccgcctc aacgccgacg actcatggac 240 

cgccgtcacg gatgggattg ctgataatgc cggctggcac aactggggca tcgacgctgt 300 

tgcgcttgat ccgcaggacg atcaaaaggt gtatgccgca gtcggcatgt atacgaacag 360 

ctgggatccg agtaatggag ccatcattcg ctcgtcagac cgcggcgcaa cgtggtcctt 420 

caccaacttg cccttcaaag tcgggggtaa catgccagga cgcggagccg gagagcgtct 480 

ggctgtcgat ccggccaact ccaacatcat ctactttggt gctcgctcag gaaacggcct 540 

ctggaagtct acggacggcg gcgtgacctt ttccaaggtc tcgtcgttca cggcaactgg 600 

gacgtacatc ccagacccga gtgattccaa cggctacaac agcgacaagc aaggactcat 660 

gtgggttacg ttcgactcaa ccagcagcac gaccggggga gccacgtctc gtatctttgt 720 

tggcacggct gataacatca ctgcttcagt ctatgtgagc acgaatgccg gctccacgtg 780 

gagtgctgta ccggggcagc cagggaaata ctttcctcac aaggcgaaac tgcagccagc 84 0 

agagaaggcc ttgtatctga cctattccga tggcacaggg ccgtatgatg gcacacttgg 900 

ctcagtgtgg aggtacgaca ttgcaggggg aacttggaaa gacatcaccc ctgtctctgg 960 

atcagatcta tactttggct ttggcggcct tggcctcgat ttgcaaaagc caggaaccct 1020 

tgttgttgct tctttgaact cttggtggcc agatgctcag ctgtttcggt cgaccgactc 1080 

tgggacaaca tggagcccga tctgggcgtg ggcgagctat ccgactgaga cctattacta 1140 

cagcatctca actcccaaag caccgtggat caagaacaac tttatcgatg tgacgagcga 1200 

gtcaccgtcc gatggtctca tcaagcgcct cggctggatg attgagtctc tcgagattga 1260 

cccaaccgac agcaaccact ggctctacgg caccggaatg acaatctttg gcggccacga 1320 

tctcaccaac tgggacacgc gccacaatgt gtcaatccaa tcactggcag acggcatcga 1380 

ggaattctcc gtccaggacc tggcctctgc acccggcgga agcgagctat tggccgcagt 1440 

cggagacgac aacggcttca cctttgccag cagaaacgac ctcgggacat cgccgcagac 1500 

ggtctgggca acgcccacat gggccacctc gacgagcgtc gactacgccg ggaactcggt 1560 

caagagcgtc gtccgcgtcg gcaacaccgc cggcacgcaa caggtggcca tctcgtccga 1620 

cggcggcgcg acgtggagca tcgactacgc ggccgacacg tccatgaacg gcggcacggt 1680 

ggcctattcg gccgacggcg acacgatcct ctggtcgacc gcctcgtccg gcgtgcagcg 1740 

ctcgcagttc cagggcagct ttgcctccgt ctcgagcctg cccgcgggcg ccgtcatcgc 1800 

ctcggacaag aagaccaaca gcgtcttcta cgccggctcc ggatcgacct tttacgtcag 1860 

caaggacacc ggcagcagct tcacgcgcgg gcccaagctg ggcagcgcag ggacgatccg 1920 

ggatatcgct gctcacccga ccaccgcggg cacgttgtat gtctcgaccg acgtcggcat 1980 

attccgctcc acagactcgg gcacgacctt tggccaagtc tccaccgccc tgaccaacac 2040 



1 



ctaccagatc 

cggcccgtca 
ccagggctcc 
cgccgggcaa 
cggcggcggc 
cgccagctcg 
gacttcgtcg 
tgctcagtgc 
ccagaagcag 
ggagagctac 
agtagagatc 
aaaaaaaaaa 



gccctgggtg 

ggggctcgcc 
cagggcttcg 
gtctacgtgg 
acgggcggga 
agcaccacgc 
aggaccagct 
ggagggattg 
aatgattatt 
atacccctag 
cagtcagata 



tgggctcagg 

tctacgccag 
gctccatcga 
gcaccaacgg 
cttcctcgtc 
tgaggtcgag 
cggccgccgg 
ggtggacggg 
actaccagtg 
gctcgcagta 
attatccatt 



ctcgaactgg 

tggagacagc 
cagcaccaag 
ccggggcgtc 
gaccaagcag 
cgttgtatcc 
tcccacgggg 
gccgacgcag 
tgtgtgatgc 
aagagctcaa 
tgtttgaatt 



aacctgtatg 
ggcgcctcct 
gtcgccggca 
ttttacgctc 
agcagcagca 
acgacccggg 
tcaggggtcg 
tgtgtggcgc 
ttgaactgcc 
gcatccgaag 
aaatgatctt 



ccttcggcac 

ggacggacat 
gcggcagcac 
agggaaccgt 
gtacctcttc 
cttcgacggt 
ccggtcatta 
cgtatgtctg 
aagctcacga 
aagcactagt 
ctattgaaaa 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2710 



<210> 2 
<211> 818 
<212> PRT 

<213> Trichoderma reesei 



<400> 2 



Ala Phe 


Ser 


Tro 


Lvs 


Asn 


Val 


Lvs 


Leu 


Gly 


Gly 


Gly 

%rf J. jr 


Gly 


Gly 


Phe 


Val 


1 






5 










10 










15 




Pro Gly 


He 


He 


Phe 


His 


Pro 


Lvs 


Thr 


Lvs 


Gly 


Val 


Ala 


Tyr 


Ala 


Ara 






20 










25 










30 






Thr Asp 


He 


Gly 


Gly 


Leu 


Tyr 


Ara 


Leu 


Asn 


Ala 


Asp 


Asp 


Ser 


Trp 


Thr 




35 










40 










45 








Ala Val 


Thr 


Asp 


Gly 


He 


Ala 


Asp 


Asn 


Ala 


Gly 


Trp 


His 


Asn 


Trp 


Gly 


50 










55 










60 










lie Asp 


Ala 


Val 


Ala 


Leu 


Asp 


Pro 


Gin 


Asp 


Asp 


Gin 


Lys 


Val 


Tyr 


Ala 


65 








70 










75 










80 


Ala Val 


Gly 


Met 


Tyr 


Thr 


Asn 


Ser 


Trp 


Asp 


Pro 


Ser 


Asn 


Gly 


Ala 


He 








85 










90 










95 




He Arg 


Ser 


Ser 


Asp 


Arg 


Gly 


Ala 


Thr 


Trp 


Ser 


Phe 


Thr 


Asn 


Leu 


Pro 






100 










105 










110 






Phe Lys 


Val 


Gly 


Gly 


Asn 


Met 


Pro 


Gly 


Arg 


Gly 


Ala 


Gly 


Glu 


Arg 


Leu 




115 










120 










125 








Ala Val 


Asp 


Pro 


Ala 


Asn 


Ser 


Asn 


He 


He 


Tyr 


Phe 


Gly 


Ala 


Arg 


Ser 


130 










135 










140 










Gly Asn 


Gly 


Leu 


Trp 


Lys 


Ser 


Thr 


Asp 


Gly 


Gly 


Val 


Thr 


Phe 


Ser 


Lys 


145 








150 










155 










160 


Val Ser 


Ser 


Phe 


Thr 


Ala 


Thr 


Gly 


Thr 


Tyr 


He 


Pro 


Asp 


Pro 


Ser 


Asp 








165 










170 










175 




Ser Asn 


Gly 


Tyr 


Asn 


Ser 


Asp 


Lys 


Gin 


Gly 


Leu 


Met 


Trp 


Val 


Thr 


Phe 






180 










185 










190 






Asp Ser 


Thr 


Ser 


Ser 


Thr 


Thr 


Gly 


Gly 


Ala 


Thr 


Ser 


Arg 


He 


Phe 


Val 




195 










200 










205 








Gly Thr 


Ala 


Asp 


Asn 


He 


Thr 


Ala 


Ser 


Val 


Tyr 


Val 


Ser 


Thr 


Asn 


Ala 


210 










215 










220 










Gly Ser 


Thr 


Trp 


Ser 


Ala 


Val 


Pro 


Gly 


Gin 


Pro 


Gly 


Lys 


Tyr 


Phe 


Pro 


225 








230 










235 










240 


His Lys 


Ala 


Lys 


Leu 


Gin 


Pro 


Ala 


Glu 


Lys 


Ala 


Leu 


Tyr 


Leu 


Thr 


Tyr 








245 










250 










255 




Ser Asp 


Gly 


Thr 


Gly 


Pro 


Tyr 


Asp 


Gly 


Thr 


Leu 


Gly 


Ser 


Val 


Trp 


Arg 






260 










265 










270 






Tyr Asp 


He 


Ala 


Gly 


Gly 


Thr 


Trp 


Lys 


Asp 


He 


Thr 


Pro 


Val 


Ser 


Gly 




275 










280 










285 








Ser Asp 


Leu 


Tyr 


Phe 


Gly 


Phe 


Gly 


Gly 


Leu 


Gly 


Leu 


Asp 


Leu 


Gin 


Lys 


290 










295 










300 










Pro Gly 


Thr 


Leu 


Val 


Val 


Ala 


Ser 


Leu 


Asn 


Ser 


Trp 


Trp 


Pro 


Asp 


Ala 


305 








310 










315 










320 


Gin Leu 


Phe 


Arg 


Ser 


Thr 


Asp 


Ser 


Gly 


Thr 


Thr 


Trp 


Ser 


Pro 


He 


Trp 



2 



Ala 


Trp 


Ala 


Ser 








340 


Pro 


Lvs 


Ala 


Pro 






355 




Ser 


Pro 


Ser 


Asp 




370 






Leu 


Glu 


He 


Asp 


385 








Met 


Thr 


He 


Phe 


Asn 


Val 


Ser 


He 








420 


Gin Asp 


Leu 


Ala 






435 




Gly Asp 


Asp 


Asn 




450 






Ser 


Pro 


Gin 


Thr 


465 








Val 


Asp 


Tvr 


Ala 


Thr 


Ala 


Gly 


Thr 








500 


Ser 


He 


Asp 


Tyr 






515 




Tyr 


Ser 


Ala 


Asp 




530 






Val 


Gin 


Arg 


Ser 


545 








Pro 


Ala 


Gly 


Ala 


Tyr Ala 


Gly 


Ser 








580 


Ser 


Phe 


Thr 


Arg 






595 




He 


Ala 


Ala 


His 




610 






Val 


Gly 


He 


Phe 


625 








Ser 


Thr 


Ala 


Leu 


Gly Ser 


Asn 


Trp 








660 


Arg 


Leu 


Tvr 


Ala 






675 




Gly 


Ser 


Gin 


Glv 
^ J. J 




690 






Gly 


Ser 


Thr 


Ala 


705 








Phe 


Tyr 


Ala 


Gin 


Ser 


Thr 


Lvs 


Gin 








740 


Thr 


Leu 


Arg 


Ser 






755 




Ser 


Ser 


Arg 


Thr 




770 






Gly 


His 


Tyr 


Ala 


785 








Cys 


Val 


Ala 


Pro 



325 



Tyr 


Pro 


Thr 


Glu 


Trp 


He 


Lys 


Asn 








360 


Gly 


Leu 


He 


Lys 






375 




Pro 


Thr 


Asp 


Ser 




390 






Gly 


Gly 


His 


Asp 


405 








Gin 


Ser 


Leu 


Ala 


Ser 


Ala 


Pro 


Gly 








440 


Gly 


Phe 


Thr 


Phe 






455 




Val 


Trp 


Ala 


Thr 




470 






Glv 


Asn 


Ser 


Val 


485 








Gin 


Val 


Ala 


He 


Ala 


Ala 


Asp 


Thr 








520 


Gly 


Asp 


Thr 


He 






535 




Gin 


Phe 


Gin 


Glv 




550 






Val 


He 


Ala 


Ser 


565 








Gly 


Ser 


Thr 


Phe 


Gly 


Pro 


Lys 


Leu 








600 


Pro 


Thr 


Thr 


Ala 






615 




Arg 


Ser 


Thr 


Asp 




630 






Thr 


Asn 


Thr 


Tyr 


645 








Asn 


Leu 


Tyr 


Ala 


Ser 


Gly 


Asp 


Ser 








680 


Phe 


Glv 


Ser 


He 






695 




Glv 
w J. jf 


Gin 


Val 


Tvr 




710 






Gly 


Thr 


Val 


Gly 


725 








Ser 


Ser 


Ser 


Ser 


Ser 


Val 


Val 


Ser 








760 


Ser 


Ser 


Ala 


Ala 






775 




Gin 


Cys 


Gly 


Gly 




790 






Tyr 


Val 


Cys 


Gin 





330 






Thr 


Tvr 


Tvr 

* Jr ■*■ 


Tvr 


345 








Asn 


Phe 


He 


Asp 


Ara 


Leu 


Glv 


Trp 








380 


Asn 


His 


Trp 


Leu 






395 




Leu 


Thr 


Asn 


Trp 




410 






Asp 


Glv 

^ Jr 


He 


Glu 


425 








Gly 


Ser 


Glu 


Leu 


Ala 


Ser 


Arg 


Asn 








460 


Pro 


Thr 


Trp 


Ala 






475 




Lys 


Ser 


Val 


Val 




490 






Ser 


Ser 


Asp 


Glv 

Jr 


505 








Ser 


Met 


Asn 


Glv 


Leu 


Trp 


Ser 


Thr 








540 


Ser 


Phe 


Ala 


Ser 






555 




Asp 


Lvs 


Lvs 

Jr ^ 


Thr 




570 






Tvr 


Val 


Ser 


Lvs 

Jr 


585 








Gly 


Ser 


Ala 


Glv 


Gly 


Thr 


Leu 


Tyr 








620 


Ser 


Gly 


Thr 


Thr 






635 




Gin 


He 


Ala 


Leu 




650 






Phe 


Glv 


Thr 


Glv 


665 








Glv 


Ala 


Ser 


Trp 


Asp 


Ser 


Thr 


Lvs 








700 


Val 


Gly 


Thr 


Asn 






715 




Glv 


Glv 

VJ J. J 


Thr 


Glv 




730 






Thr 


Ser 


Ser 


Ala 


745 








Thr 


Thr 


Arg 


Ala 


Gly 


Pro 


Thr 


Gly 








780 


He 


Gly 


Trp 


Thr 






795 




Lys 


Gin 


Asn 


Asp 







335 




Ser 


He 


Ser 


Thr 




350 






Val 


Thr 


Ser 


Glu 


365 








Met 


He 


Glu 


Ser 


Tvr 


Glv 


Thr 


Glv 
w J. 








400 


Asp 


Thr 


Arg 


His 






415 




Glu 


Phe 


Ser 


Val 




430 






Leu 


Ala 


Ala 


Val 


445 








Asp 


Leu 


Gly Thr 


Thr 


Ser 


Thr 


Ser 








480 


Arg 


Val 


Gly Asn 






495 




Glv 


Ala 


Thr 


Trp 




510 






Glv 


Thr 


Val 


Ala 


525 








Ala 


Ser 


Ser 


Gly 


Val 


Ser 


Ser 


Leu 








560 


Asn 


Ser 


Val 


Phe 






575 




Asp 


Thr 


Gly 


Ser 




590 






Thr 


He 


Arg Asp 


605 








Val 


Ser 


Thr Asp 


Phe 


Glv 


Gin 


Val 








640 


Gly 


Val 


Gly 


Ser 






655 




Pro 


Ser 


Gly Ala 




670 






Thr 


Asp 


He 


Gin 


685 








Val 


Ala 


Gly 


Ser 


Glv 

^ J. jr 


Arg 


Gly Val 








720 


Gly 


Thr 


Ser 


Ser 






735 




Ser 


Ser 


Ser 


Thr 




750 






Ser 


Thr 


Val 


Thr 


765 








Ser 


Gly 


Val 


Ala 


Gly 


Pro 


Thr 


Gin 








800 


Tyr 


Tyr 


Tyr 


Gin 



805 810 815 

Cys Val 



<210> 3 
<211> 19 
<212> PRT 

<213> Trichoderma reesei 
<400> 3 

Met Lys Val Ser Arg Val Leu Ala Leu Val Leu Gly Ala Val lie Pro 

15 10 15 

Ala His Ala 



<210> 4 
<211> 2517 
<212> DNA 

<213> Trichoderma reesei 



<400> 4 

atgaaggtct ctcgagtcct tgcccttgtc ctgggggccg tcatccctgc ccatgctgcc 60 

ttttcatgga agaacgtcaa gctcggcggc ggcggcggct tcgtccccgg catcatcttc 120 

catcccaaga caaaaggcgt agcatatgca cgaacagata ttggcgggct gtaccgcctc 180 

aacgccgacg actcatggac cgccgtcacg gatgggattg ctgataatgc cggctggcac 240 

aactggggca tcgacgctgt tgcgcttgat ccgcaggacg atcaaaaggt gtatgccgca 300 

gtcggcatgt atacgaacag ctgggatccg agtaatggag ccatcattcg ctcgtcagac 360 

cgcggcgcaa cgtggtcctt caccaacttg cccttcaaag tcgggggtaa catgccagga 420 

cgcggagccg gagagcgtct ggctgtcgat ccggccaact ccaacatcat ctactttggt 480 

gctcgctcag gaaacggcct ctggaagtct acggacggcg gcgtgacctt ttccaaggtc 540 

tcgtcgttca cggcaactgg gacgtacatc ccagacccga gtgattccaa cggctacaac 600 

agcgacaagc aaggactcat gtgggttacg ttcgactcaa ccagcagcac gaccggggga 660 

gccacgtctc gtatctttgt tggcacggct gataacatca ctgcttcagt ctatgtgagc 720 

acgaatgccg gctccacgtg gagtgctgta ccggggcagc cagggaaata ctttcctcac 780 

aaggcgaaac tgcagccagc agagaaggcc ttgtatctga cctattccga tggcacaggg 840 

ccgtatgatg gcacacttgg ctcagtgtgg aggtacgaca ttgcaggggg aacttggaaa 900 

gacatcaccc ctgtctctgg atcagatcta tactttggct ttggcggcct tggcctcgat 960 

ttgcaaaagc caggaaccct tgttgttgct tctttgaact cttggtggcc agatgctcag 1020 

ctgtttcggt cgaccgactc tgggacaaca tggagcccga tctgggcgtg ggcgagctat 1080 

ccgactgaga cctattacta cagcatctca actcccaaag caccgtggat caagaacaac 1140 

tttatcgatg tgacgagcga gtcaccgtcc gatggtctca tcaagcgcct cggctggatg 1200 

attgagtctc tcgagattga cccaaccgac agcaaccact ggctctacgg caccggaatg 1260 

acaatctttg gcggccacga tctcaccaac tgggacacgc gccacaatgt gtcaatccaa 1320 

tcactggcag acggcatcga ggaattctcc gtccaggacc tggcctctgc acccggcgga 1380 

agcgagctat tggccgcagt cggagacgac aacggcttca cctttgccag cagaaacgac 1440 

ctcgggacat cgccgcagac ggtctgggca acgcccacat gggccacctc gacgagcgtc 1500 

gactacgccg ggaactcggt caagagcgtc gtccgcgtcg gcaacaccgc cggcacgcaa 1560 

caggtggcca tctcgtccga cggcggcgcg acgtggagca tcgactacgc ggccgacacg 1620 

tccatgaacg gcggcacggt ggcctattcg gccgacggcg acacgatcct ctggtcgacc 1680 

gcctcgtccg gcgtgcagcg ctcgcagttc cagggcagct ttgcctccgt ctcgagcctg 174 0 

cccgcgggcg ccgtcatcgc ctcggacaag aagaccaaca gcgtcttcta cgccggctcc 1800 

ggatcgacct tttacgtcag caaggacacc ggcagcagct tcacgcgcgg gcccaagctg 1860 

ggcagcgcag ggacgatccg ggatatcgct gctcacccga ccaccgcggg cacgttgtat 1920 

gtctcgaccg acgtcggcat attccgctcc acagactcgg gcacgacctt tggccaagtc 1980 

tccaccgccc tgaccaacac ctaccagatc gccctgggtg tgggctcagg ctcgaactgg 2040 

aacctgtatg ccttcggcac cggcccgtca ggggctcgcc tctacgccag tggagacagc 2100 

ggcgcctcct ggacggacat ccagggctcc cagggcttcg gctccatcga cagcaccaag 2160 

gtcgccggca gcggcagcac cgccgggcaa gtctacgtgg gcaccaacgg ccggggcgtc 2220 

ttttacgctc agggaaccgt cggcggcggc acgggcggga cttcctcgtc gaccaagcag 2280 

agcagcagca gtacctcttc cgccagctcg agcaccacgc tgaggtcgag cgttgtatcc 2340 



4 



acgacccggg cttcgacggt gacttcgtcg aggaccagct cggccgccgg tcccacgggg 24 00 
tcaggggtcg ccggtcatta tgctcagtgc ggagggattg ggtggacggg gccgacgcag 2460 
tgtgtggcgc cgtatgtctg ccagaagcag aatgattatt actaccagtg tgtgtga 2517 



5 



